NUS at DUC 2007: Using Evolutionary Models of Text

نویسندگان

  • Ziheng Lin
  • Tat-Seng Chua
  • Min-Yen Kan
  • Wee Sun Lee
  • Long Qiu
  • Shiren Ye
چکیده

This paper presents our new, querybased multi-document summarization system used in DUC 2007. Current graph-based approaches to text summarization, such as TextRank and LexRank, assume a static graphmodel which does not model how input text emerges. A suitable evolutionary graph model that is related to human writing/reading process may impart a better understanding of the text and improve the subsequent summarization process. We propose a timestamped graph (TSG) model motivated by human writing and reading processes, and show how input text emerges under the construction phase of TSG. We applied TSG on both the main task and update summary task in Document Understanding Conferences (DUC) 2007 and achieved satisfactory results. We also suggested a modified MMR re-ranker for the update task.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Timestamped Graphs: Evolutionary Models of Text for Multi-Document Summarization

Current graph-based approaches to automatic text summarization, such as LexRank and TextRank, assume a static graph which does not model how the input texts emerge. A suitable evolutionary text graph model may impart a better understanding of the texts and improve the summarization process. We propose a timestamped graph (TSG) model that is motivated by human writing and reading processes, and ...

متن کامل

NUS at DUC 2005: Understanding Documents via Concept Links

The primary goal of our participation in DUC 2005 is two-fold. One is to benchmark the performance of a method of computing sentence semantic similarity. The other is to test the effectiveness of a new redundancy minimization formula inspired by Maximal Marginal Relevance(MMR). By using only these two features and eschewing other heuristics, our system performed competitively, achieving the top...

متن کامل

Optimization of sediment rating curve coefficients using evolutionary algorithms and unsupervised artificial neural network

Sediment rating curve (SRC) is a conventional and a common regression model in estimating suspended sediment load (SSL) of flow discharge. However, in most cases the data log-transformation in SRC models causing a bias which underestimates SSL prediction. In this study, using the daily stream flow and suspended sediment load data from Shalman hydrometric station on Shalmanroud River, Guilan Pro...

متن کامل

A Term Frequency Distribution Approach for the DUC-2007 Update Task

We present our system used in the DUC 2007 update task, which is our first entry in any of the DUC evaluations. We make use of ideas within our existing FreqDistSumm text summarizer, which has been shown to perform well in biomedical text summarization. Our system submitted to the DUC Update Task, called FreqDistUpdate, uses a context sensitive approach to scoring sentences based on a frequency...

متن کامل

NUS at DUC 2006: Document Concept Lattice for Summarization

Concepts composed of open-class terms after semantic equivalence discovery can be considered as simplified basic elements. We utilize frequent concept sets to construct a Document Concept Lattice, which contains hierarchical summary information of a document cluster. Based on this lattice, we further extract a set of sentences with maximal representative power and minimal redundancy for summari...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007